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(54) Process and system for generating stereoscopic images from monocular images 



(57) To generate, from at least on monocular image 
(12), at least one image with stereoscopic effect (18, 
20), a depth map is provided (14) of said monocular im- 
age (a). The depth map in question defines a mapping 
of the objects present in the monocular image in relation 
to their spatial position in the direction of depth. For the 
pixels included in the monocular image and as a function 
of the depth map, respective parallax values are deter- 
mined (1 6) indicative of a respective quantity of move- 
ment by which each pixel must be moved in the image 



with stereoscopic effect (18, 20) in relation to the mo- 
nocular image (12). A movement corresponding to the 
respective quantity thus defined is then applied to the 
pixels of the monocular image (1 2), if necessary dividing 
it between two images of a stereoscopic pair (18, 20). 
Post-processing (22) of the image generated is also en- 
visaged, to eliminate artifacts such as superimposed ar- 
eas between moved pixels, a lateral edge of the image 
or gaps between objects included in said image gener- 
ated as a result of the aforesaid movement of pixels. 
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Description 

Field of the invention 

[0001 ] The present invention relates to the art of ster- 
eoscopic vision, and in particular tackles the problenn of 
reconstructing the stereoscopic vision of objects from 
the availability of monocular vision of these objects and 
the relative depth map. 

[0002] When a photograph is acquired, for example 
using a camera, the photo provides a two-dimensional 
image of the scene in which the depth information and 
distance between objects (or the different regions of the 
scene) are not present. By looking at these two-dimen- 
sional images, an observer may virtually reconstruct the 
depth information through previously acquired knowl- 
edge of the world represented in this photo. 
[0003] Various factors come into play in this recon- 
struction mechanism, such as retinal disparity, parallax 
and interaxial distance. 

[0004] The concept of retinal disparity can be easily 
understood with reference to a simple experiment: hold 
the thumb of one hand in front of your face and observe 
it. 

[0005] When observation is concentrated on the 

thumb, the eyes of the observer converge on this thumb, 
so that the optical axes of both eyes go through the 
thumb. Various muscles of the head move the eyes to 
obtain this result, arranging the images of the thumb on 
each fovea or central part of each retina. If, continuing 
to make the eyes converge on the thumb, the back- 
ground is observed, it is seen that the background 
seems to double. 

[0006] However, if the eyes are concentrated on the 
background, the eyes converge on the background and 
it is now the thumb that is seen with a double image. 
[0007] If the images that form on the retina of the right 
eye and on the retina of the left eye could be taken and 
superimposed, approximately the same scene would be 
obtained almost superimposed (right and left perspec- 
tive view points), and the difference between these im- 
ages is called "disparity". Disparity is the distance, in a 
horizontal direction, between the corresponding right 
and left points of the superimposed retinic images. The 
corresponding points of the retinal images of an object 
on which the eyes are converged have zero disparity. 
[0008] For further information on the principles and 
criteria at the basis of stereoscopic vision it is useful to 
refer to the website http://iglasses.weirdoz.org/docs/ 
handbook.pdf corresponding to the Stereograph ic De- 
velopers' Handbook. 

[0009] Retinal disparity is caused by the fact that each 
eye sees the world from a different view point. Eyes are, 
on average in adults, spaced apart by about 64 mm (this 
distance between the right and left eyes is called base 
line or interocular distance). 

[001 0] Disparity is processed by the eye-brain system 
into a single image of the visual world. The capacity of 



the mind to unite two different images, even if similar, 
into one single image is called fusion, and the resulting 
sense of depth is called stereopsis. 
[0011] Other important concepts in order to under- 

5 stand the mechanism of stereoscopic vision are parallax 
and disparity. These are similar entities, with the differ- 
ence given by the fact that the parallax is measured on 
the screen and the disparity is measured on the retina. 
It is the parallax that produces retinal disparity and the 

10 disparity in turn which produces stereopsis. Parallax 
may also be given in terms of angular measurements, 
which connects it to the disparity considering the dis- 
tance of the observer from the viewing screen. Parallax 
is an important entity, as it produces the stereoscopic 

f5 sense of depth. 

[0012] The different types of parallax that can be en- 
countered when observing a viewing screen are classi- 
fied hereunder. 

[0013] In the case in which the object is behind the 
20 projection plane the conditions represented in figure 1 
are found. 

[0014] In this figure, the pointto project, indicated with 
P, if found "behind" the projection plane S, or on the op- 
posite side in relation to the left and hght eyes, indicated 
25 with LE and RE respectively. 

[0015] Projection for the left eye LE is to the left and 
projection for the right eye RE is to the right. The dis- 
tance between the right and left projections of the eye 
is called horizontal parallax PO. As the projections are 
30 found on the same side as the eyes, it is called positive 
parallax. The maximum positive parallax PO is obtained 
when the object is infinite. At this point, the horizontal 
parallax PO is equal to the base line or interocular dis- 
tance. 

35 [0016] If the object or point to project P is positioned 
further forward in relation to the projection plane S the 
conditions represented in figure 2 are produced. Projec- 
tion for the left eye LE is to the right and projection for 
the RE eye is to the left, in this case conditions of neg- 
40 ative horizontal parallax OP are found. A negative hor- 
izontal parallax OP equal to the base line or interocular 
distance is obtained when the object is at a distance 
equal to half the distance between the projection plane 
S and the centre of the axis of the eyes. When the object 
45 moves towards the observer the negative horizontal 
parallax OP increases infinitely. 

[0017] If an object is on the projection plane S (figure 
3), relative projection on the focal plane coincides for 
both the left LE and right RE eyes. The homologous 
50 points of the two images correspond exactly or are one 
on top of the other. When the eyes of the observer are 
looking at the viewing screen S (or the eyes converge 
on the plane of the screen) images are being observed 
with zero parallax. 
55 [0018] In this case too, for more detailed information 
reference can be made to the website http://astronomy. 
swind.edu .au/pbourke/stereographics/st ereorender 
and in particularto the Calculating Stereo Pairs section. 
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[0019] The distance between the objectives used to 
take a stereoscopic photograph is called interaxial dis- 
tance, as regards the axes of the lenses. The axis of the 
objective is the line that can be drawn through the optic 
centre of the objective and is perpendicular to the plane 
of the surface on which the innage is fornried. 
[0020] Whether this is a real objective or the construc- 
tion of an objective is being used to generate an image 
generated by the connputer, the concept is the sanne. If 
the objectives are near, the stereoscopic effect of depth 
is reduced. In the extreme case in which the two objec- 
tives (and axes) coincide, the final result is a planar im- 
age. By increasing the distance between the objectives, 
both the parallax and the stereoscopic effect increase. 
In this regard reference may again be made to the Ste- 
reographic Developers' Handbook website, mentioned 
above. 

[0021] In general the interaxial distance may be con- 
sidered as coinciding with the interocular distance when 
the eye is considered as an image acquiring system. 

Description of the prior art 

[0022] In numerous applicative contexts (for example 
in medicinal and scientific, physical and constructional 
applications, and in the entertainment, information and 
education sectors) there is an interest to produce, as a 
characterizing element for viewing, a three-dimensional 
effect, in particular obtaining this effect on a viewing de- 
vice, such as a screen of various types, which itself has 
two-dimensional or, at least, substantially two-dimen- 
sional properties. 

[0023] For example, WO-A-01/3951 2 describes a de- 
vice destined to convert a two-dimensional video image 
into a three-dimensional video image. The device com- 
prises means to correct the parallax of each area calcu- 
lated by means to calculate the parallax on the basis of 
the magnitude of a moving vector. This solution is used 
to avoid the disadvantage caused by the fact that the 
stereoscopic sense of the video signal obtained by con- 
version is strongly influenced, with the same input video 
signal, as a function of the criteria employed to convert 
the video signal. In particular, the document in question 
refers to prior art procedures such as MTD (Modified 
Time Difference) and CID (Computed Image Depth). 
When a depth estimate is converted into a parallax, the 
estimate in question is subjected to a conversion of the 
distances scale so as to suppress the distortion of the 
image resulting from conversion and to determine, at 
least at tentative level, a target phase for each calcula- 
tion area of the parallax. A dynamic field is identified in 
which the differences in phase between the parallax cal- 
culation areas are within an admissible change of dis- 
tortion, producing a conversion of the distances scale, 
also determining, again at tentative level, a target phase 
and repeating this processing sequence. 
[0024] In more generic terms, the general problem 
linked to these conversion or construction techniques is 



given by the need to reconstruct the correct retinal dis- 
parity of the objects present in the scene, on the basis 
of the interaxial distance of the observer and the desired 
parallax effect. 

5 

Objects and summary of the present invention 

[0025] The object of the invention is thus to provide a 
solution for the generation of stereoscopic images from 

10 monocular images capable of giving rise to extremely 
satisfactory results from the viewpoint of quality. All this 
giving rise to a processing system architecture that al- 
lows easy implementation at a physical level, in partic- 
ular at the level of hardware devices. 

15 [0026] According to the present invention, this object 
is attained thanks to a process with the characteristics 
indicated specifically in the claims hereunder. 
[0027] The invention also relates to the relative sys- 
tem. 

20 [0028] In brief, in the currently preferred embodiment, 
the solution according to the invention allows the gen- 
eration of the stereoscopic pair from a single source im- 
age of the monocular type and from the relative depth 
information. 

25 [0029] This image or depth map is an image with lev- 
els of grey (for example with values from 0 to 255 - there- 
fore on eight bits), which represents the depth of each 
object within the source image. The conjunction of the 
source image and the depth map allow reconstruction 

30 of the binocular view, thus producing a three-dimension- 
al effect. 

[0030] A final filtering treatment makes it possible to 
solve the "occlusions", generated by introducing the 
parallax effect, without however introducing visible and/ 
35 or bothersome artefacts. 

[0031] This process is preferably performed on the 
basis of all the three-dimensional coordinates of the ob- 
jects inside the image. 

[0032] Each object, each dot, each figure is then iden- 
40 titled, divided, separated, calculated and so on, also 
knowing the parameters of the acquisition system (for 
example the parameters of the camera or television 
camera). 

[0033] These data can be used to reconstruct the 
45 world in three dimensions with the real distance be- 
tween objects, their occlusions and reproduction of the 
distortions of these objects. 

[0034] The lack of the specific information described 
above is solved by the availability of the aforesaid depth 
50 map. 

[0035] In the currently preferred embodiment, the so- 
lution according to the invention envisages the perform- 
ance of three classes of operations: acquisition, recon- 
struction (or processing) and viewing. 
55 [0036] With regard to the acquisition phase, the solu- 
tion according to the invention is susceptible to prescind 
from the input information referring to the concept of the 
depth map, which is susceptible to provide at least the 



3 



5 



EP1 353 518 A1 



6 



form, the position of the objects and the relative depth 
in the scene. 

[0037] In the reconstruction phase all the infornnation 
input into the system is processed, the right and left 
views are generated, the problems of occlusion and su- 
perimposition between different objects caused by the 
introduction of the parallax are solved, the best condi- 
tions of distance from the screen for the user to obtain 
ideal vision of the three-dimensional effect are suggest- 
ed. 

[0038] All this tal<ing account of two important ele- 
ments, namely independence from the different quality 
and content of the depth map input into the system, and 
lastly the effect of depth perception selected by the user. 
[0039] In the viewing phase it is possible to select the 
type of three-dimensional format to be produced. 
[0040] By way of an example, this selection may in- 
clude the choice of anaglyphs. 

[0041] This derives mainly from the fact that three-di- 
mensional vision depends heavily on the viewing tech- 
nology used: holographic, stereoscopic, volumetric and 
so on. 

[0042] Use of an anaglyph has the advantage of al- 
lowing an easy and low cost solution to the problem, 
even in the case in which reproduction is performed on 
a paper medium. 

Brief description of the accompanying drawings 

[0043] The invention shall now be described, purely 
as a non-limiting example, with reference to the accom- 
panying drawings, in which: 

Figures 1 to 3, relative to definition of the concept 
of parallax, have already been described. 
Figure 4, divided into two portions indicated with a 
and b respectively, represents two possible exam- 
ples of depth maps. 

Figure 5 is a diagram illustrating the perception of 
the sense of depth, 

Figure 6 illustrates, in the form of a block diagram, 
the possible architecture of a system according to 
the invention. 

Figure 7 illustrates, again in the form of a block di- 
agram, a possible architecture of a system for cal- 
culating the parallax. 

Figures 8 and 9 illustrate in greater detail the rela- 
tive calculation geometry; in particularfigure 9 illus- 
trates the geometry in the case in which the images 
of the stereoscopic pair are constructed taking ac- 
count of the original view point, or the view point at 
the time of acquisition of the monocular image, ex- 
actly in the centre in relation to the new view points, 
thus requiring the parallax to be distributed in equal 
parts between the two final images of the stereo- 
scopic pair. 

Figure 1 0 illustrates the mechanism for moving data 
to the left and right as a function of the parallax val- 



ues allocated to the pixels treated and the conse- 
quent presence of black lateral bars (due to the lack 
of real data) in the positions opposite to the move- 
ment implying reduction in the dimension of the im- 

5 age. 

Figures 1 1 and 1 2 illustrate the aforesaid movement 
mechanism in the regions inside the final image, in 
particular where there is the need to manage prob- 
lems relative to reconstruction and superimposition, 
10 while solving the occlusions, 

Figure 13 illustrates the upper and lower limits of 
the parallax quantity admissible by the human vi- 
sion system. 

Figures 14 and 15 illustrate further geometries rel- 
15 atlve to calculation of the parallax, 

Figure 16 illustrates the area of application of the 
monotone function in defining the depth as a func- 
tion of the parallax, 

Figures 1 7 and 1 8 illustrate different monotone ori- 
20 entations in the choice of possible functions in the 
area described by figure 1 6 above, 
Figure 19 illustrates, relative to a possible orienta- 
tion, the ratio with the scale of possible values (from 
0 to 255) of permitted depth levels, and 
25 - Figure 20 illustrates, for comparison, an original 
two-dimensional image (a), the relative depth map 
(b) and corresponding anaglyphs (cto h) generated 
according to the invention. 

30 Detailed description of an exemplary embodiment of the 
invention 

[0044] To reconstruct the binocular or stereoscopic 
view of a single image, the first step is to define the 

35 source view point: in other words, to decide if the two 
final images (right and left) must be recreated from noth- 
ing having that source of reference, or if the source im- 
age is to be exactly one of the final images while only 
the other is to be reconstructed from it. 

40 [0045] This decision may greatly influence the arte- 
facts susceptible to be produced during application of 
the parallax value. In fact, during this phase the objects 
are reconstructed in a different position from the original 
one and this simultaneously causes three situations: 

45 

two objects originally near each other move away 

from each other, 

two objects originally near each other remain in their 
original positions, 
50 - two objects originally near each other are superim- 
posed. 

[0046] This movement is the direct function of the par- 
allax value associated to the objects. Therefore the 
55 greater the parallax value, the greater possible artefacts 
introduced to the final image will be (greater distance or 
greater superimposition). 

[0047] From this assumption, it is evident that the final 
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choice is between dividing tlie problenn of tine artefacts 
between both final images or allowing the problenn to be 
established on only one of the two images. In general, 
it may be thought that by dividing the problem into equal 
parts between the two images, it will be half visible on 
both images. It is also true that it is present constantly 
for the entire time. On the contrary, by leaving the prob- 
lem on only one image, it is only present on the recon- 
structed image, butvisibleonlyforhalf of thetime. Clear- 
ly between these two extremes, cases are also contem- 
plated in which by distributing the parallax into different 
parts for each of the two final images, but so that the 
total quantity of parallax remains the same, the artefacts 
will be visible at different times with different persist- 
ence. 

[0048] Normally, one may think that, as the initial two- 
dimensional image is a convergent projection of the 
three-dimensional environment, the first action is the 
one most suited for the final phase of reconstruction by 
means of applying the parallax. However this does not 
imply a restriction or differentiation of the conversion 
problem. 

[0049] The present invention hereunder proposes a 
solution to limit visibility of the artefacts, not by using 
special functions to masl< these but by variable modifi- 
cation of a reduction/enlargement factor of the depth 
perception inside the viewing device. 
[0050] By way of an example, it is possible to establish 
that the most simple example of artefact susceptible to 
be generated in an image is a so-called "hole" or "gap", 
or a part of surface in which image information is not 
available. Although in its original form it is represented 
by a single dot or pixel, due to introduction of the parallax 
along the horizontal axis this form may be transformed 
into a segment and, as a result of processing along the 
vertical direction, this form may also be extended to an 
area. This area will have different dimensions and forms 
for each object and for each parallax value associated 
to it. 

[0051 ] Whatever the choice adopted to generate ster- 
eoscopic images from monocular images it is necessary 
to calculate a parallax value to be applied. 
[0052] To do so some initial elements must be avail- 
able, such as: 

the dimension of the screen, 
the dimension of the image to be projected, 
the minimum distance of the user from the repro- 
duction screen, and 
the depth of the objects. 

[0053] The first two elements are useful in order to 
generate images with dimensions congruous with the 
device utilized for viewing. 

[0054] The third element can be easily calculated hav- 
ing the first two elements and using the viewing speci- 
fications of the human eye. 

[0055] For applications in which these values are not 



modified, it can be assumed that they will be fixed in the 
system simplifying the I/O and speeding up use during 
computation. 

[0056] The fourth (namely the depth of the objects) is 

5 instead an essential element on which reconstruction of 
the three-dimensional environmentfrom the two-dimen- 
sional projection is based. The present invention utilizes 
the concept of depth map for this purpose. 
[0057] The depth map, in practice the linl< existing be- 

10 tween two-dimensional and three-dimensional worlds 
(or what is sometimes defined two-and-a-half dimen- 
sions or 2D W), is a mapping of the objects present in 
an image in relation to their spatial position. 
[0058] This mapping may alternatively have two 

15 meanings. 

[0059] The first identifies the exact correspondence 
of the space of each dot of the image, and in this case 
may be defined as depth map. 

[0060] In the second case it is possible to define the 
20 distinction between objects present in the scene, with- 
out any correspondence with the direction of real depth. 
In this case, although in this regard the concept of depth 

map will be used hereunder without any distinction, it is 
more accurate to talk of disparity map. 
25 [0061] The morethe difference between the two maps 

tends to decrease, the more their meaning tends to 
overlap; for this reason figure 4 presents two possible 
examples of maps of this type. 

[0062] This therefore requires the definition of a sys- 
30 tem in which, to reconstruct a three-dimensional envi- 
ronment, the processing system does not depend on the 
type of map, but only on the data contained in this. 
[0063] Also taken into consideration within the system 
developed was the possibility of being able to vary the 
35 depth effect, according to the requests of a possible ex- 
ternal user, so that viewing is comfortable with any depth 
map input. 

[0064] In both cases, to provide an immediately un- 
derstandable example, this is an image with different 

40 levels of grey in which each area reproduced with a spe- 
cific level of grey (shown in figure 4 with different sec- 
tioning) identifies the positioning, within the extent of the 
image, of objects or of a group of objects t found in a 
certain distance range in relation to the observation 

45 point. 

[0065] In the specific case, both images of figure 4 
reproduce different depth maps of an image of a moun- 
tain panorama (essentially the view of a valley). The 
overall effect that can be deduced by observing the 

50 depth map is almost similar to the effect perceivable by 
observing a group of stage settings or by observing a 
mountain landscape at daybreak or dusk in conditions 
in which the reduced level of luminosity makes chromat- 
ic information difficult to perceive and, at the same time, 

55 the grazing illumination of the sun low on the horizon 
highlights the staggering, on various subsequent per- 
spective planes, of the raised areas included in the land- 
scape. 



5 



9 



EP1 353 518 A1 



10 



[0066] For clarity it is stressed tliat tine present inven- 
tion does not deal with the problenn of generating the 
depth map of the image reproduced. It will thus be pre- 
sumed that this map is in any case available hereunder 

in the present description. 

[0067] When we observe the environment around us, 
it is evident that the geometrical forms of nearby objects 
and their relative distances are clearly visible, while this 
tends not to be the case for distant objects. 
[0068] The phenomenon is linked to so-called depth 
perception and is inversely proportional to distance from 
the object in question. Therefore the problem is posed 
of how to correctly define a depth map that complies with 
this rule. 

[0069] Figure 5 illustrates a typical effect of perception 
of three-dimensional or 3D space, identifying, on the 
one hand, equidistant levels of depth EL and the relative 
perception PER. 

[0070] Figure 5 defines several levels of depth equi- 
distant from one another, so that the initial rule is not 
violated. This also means that a real depth map can be 
emulated with an appropriate disparity map. In other 
words, even if we do not know the real three-dimension- 
al position of the objects, but by judiciously distributing 
each object on different levels equidistant from one an- 
other, it is possible to generate - a - depth effect very 
similar to the real one: in this respect, it must again be 
mentioned that the object set by the solution according 
to the invention is to reconstruct three-dimensionality of 
the scene and not exactly the real one. 
[0071] As already mentioned, a inherent phenome- 
non in generating a stereoscopic image from a monoc- 
ular image is given bythefomnation of "holes" or "gaps", 
or groups of missing data of the reconstructed images 
caused by horizontal movement of the objects inside the 
image. 

[0072] These holes relate to the unknown data that 
belong to the objects included in the observed scene, 
but of which nothing is known: colour, luminosity and so 
on. These holes must therefore be filled in some man- 
ner, also taking account of the fact that this filling proc- 
ess inevitably causes the introduction of some artefacts. 
[0073] Therefore it is necessary to operate in an man- 
ner that makes these artefacts invisible. 
[0074] As the problem is linked to the fact that areas 
for which there are no known data are discovered, these 
holes are not easy to fill and, when performing this task, 
care must be taken not to generate visible problems. 
[0075] This task may be facilitated by analysis of the 
initial image in the temporal domain. As well as creating 
difficulties linked to the fact that temporal analysis of a 
sequence of images requires the memorization of a 
large number of images, this analysis might never reveal 
the original information that we are attempting to recon- 
struct. This is due, for example, to the fact that a certain 
object (or usually a certain portion of this) might never 
be visible during the entire sequence, irrespective of the 
number of images considered. 



[0076] Moreover, it must be borne in mind that, at least 
in some applications, there is only one image available 
as the initial image. 

[0077] These artefacts or holes are therefore a prob- 

5 lem that require an adequate solution. 

[0078] A technique that may be suggested to de- 
crease the presence and/or visibility of these artefacts 
is one that, by limiting as much as possible a magnitude 
of the holes on the final images and maintaining the vis- 

10 ual three-dimensional effect, fills or fills in these holes 
or gaps with the original information in the homologous 
positions. 

[0079] There are also solutions in principle more ef- 
fective than this. In general, however, these alternative 
15 techniques depend greatly on the final viewing technol- 
ogy. 

[0080] Figure 6 illustrates in the form of a block dia- 
gram the architecture of a system for the reconstruction 
of a three-dimensional scene according to the criteria 
20 set forth above. 

[0081] The aforesaid system, indicated as a whole 
with 1 0, comprises corresponding modules that perform 
the following functions: 

25 - reading of the two-dimensional image (input mod- 
ule 12), 

reading of the depth map, generally provided from 
the outside (input module 14), 
pre-processing for calculation of the parallax from 
30 the image read in module 12 and from the depth 
map read in module 14 (the module for calculation 
of the parallax is indicated with 16), 
generation of the left view (module 1 8), 
generation of the right view (module 20), 
35 - processing of the view generated in modules 18 and 
20 by truncating the edges and solving the occlu- 
sions (module 22), and 

generation of the three-dimensional output image 
(module 24). 

40 

[0082] Module 12 is composed mainly of data mem- 
orizing structures (of the known kind) in which the two- 
dimensional images destined to be placed at the dispos- 
al of the other processing blocks present in the system 
45 are loaded. No general limits are set on the quantity of 
the images that can be loaded. 

[0083] Hereunder in the present description reference 
will also be made, at least implicitly, to images that are 
acquired and processed in digital form. Those skilled in 
50 the art will in any case appreciate that the scope of the 
invention is in no way limited to this solution of acquisi- 
tion and processing. The solution according to the in- 
vention in fact is suitable to be implemented, at least in 
principle, making use of techniques to acquire and proc- 
55 ess signals in which these signals maintain, at least in 
part, the features of an analogue signal, even if sam- 
pled. 

[0084] A typical example of these processing meth- 
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ods is given by the techniques for processing innages 
based on the use of neural networks which, fortheir nna- 
trix structure, aresuitableto be coupled directly to image 
sensors which also have a matrix structure. 
[0085] Module 1 4 implements acquisition of the depth 
map connected to the image or images input into the 
system through module 12. As already mentioned, the 
present invention does not specifically relate to the gen- 
eration of the aforesaid depth map, which may be made 
available to the system described herein employing any 
prior art solution for this purpose. Clearly, the quantity 
of depth data loaded in module 14 must correspond to 
the quantity of source data loaded in module 12. 
[0086] One of the parameters to be considered in 
three-dimensional reconstruction is correlation between 
the quantity of parallax to obtain the three-dimensional 
effect and the quantity of pixels corresponding to it. 
[0087] As we saw in the introductory part of the 
present description, the parallax value as a function of 
the interocular distance must be converted into a spe- 
cific quantity of pixels to move data horizontally to the 
right and left. It is therefore necessary to determine the 
correct conversion function that considers the two sys- 
tems different to value the pixel/distance ratio (usually 
expressed in millimetres). 

[0088] At the end of the processing chain, the images 
will be viewed on the dedicated screen, with a fixed di- 
mension and fixed resolution. It may thus be assumed 
that, by observing the three-dimensional image with the 
full screen, the image must also have the same dimen- 
sion and resolution as the video. If this is not the case, 
the final image must be processed (according to prior 
art which does not require to be described herein) so 
that it is then aligned with these values. 
[0089] Nonetheless, by assuming full screen viewing, 
it is possible to easily assess the pixels/millimetres ratio 
with the following ratio: 

Pixel/mm = (Xp,.Yp,)/(W^„.H 

mm' 

where the input image has [X:Y] pixels and where 
the final screen has the dimensions [W:H]. 
[0090] The correct value for identification of the pix- 
els/millimetres ratio is expressed extremely well in the 
function viewed above. 

[0091] In view of the fact that the parallax is only ef- 
fective in the horizontal dimension (we are obviously 
talking of an image observed horizontally, considering 
in fact the eyes of the observer aligned on a horizontal 
axis), it is possible to simplify the aforesaid function as 
follows: 

pixel/mm = Xp^/W^^ 



[0092] To obtain the three-dimensional effect, the 
original pixel and its replica must be moved (horizontal- 



ly) by a quantity defined by the parallax value to be ap- 
plied. 

[0093] In particular, to obtain uniform distribution of 
the depth effect (see the considerations above in rela- 

5 tion to the depth perception), it is necessary to deter- 
mine the range of values that can be taken by the par- 
allax by calculating its minimum and maximum value. 
[0094] These values are expressed unequivocally 
and irrespective of the viewing technology: therefore, for 

10 example, in millimetres. As these movements must be 
viewed, they can be transformed into units of measure- 
ment relative to the image, that is the number of pixels. 
[0095] To do so the pixel to millimetre ratio previously 
calculated must be known. 

15 [0096] The diagram in figure 7 illustrates, in the form 
of a functional block diagram, the criteria used to calcu- 
late the final parallax. 

[0097] In particular, in the diagram of figure 7 refer- 
ences 26 and 28 illustrate the data (which can be seen 
20 as data files) containing the information relative to the 
interocular distance and the decrease in depth (file 26) 
and the image length, the image width and the diagonal 
dimension of the video (file 28). 

[0098] The block 30 represents the functional module 

25 which detemnines the distance and resolution, in partic- 
ular generating the information relative to the minimum 
user-screen distance (intended as the minimum dis- 
tance at which the image is viewed on the screen com- 
fortably for the human vision system, distances greater 

30 than this minimum value increase the comfort of viewing 
but give increasingly less depth perception) and the pix- 
els/millimetres ratio seen previously. 
[0099] From the information on the minimum user- 
screen distance, a module indicated with 32 calculates, 

35 on the basis of the data contained in file 26, the parallax, 
intended as maximum parallax. 
[01 00] The block 34 indicates the module destined to 
act as parallax motor destined to feed towards a module 
acting as movement motor, indicated with 36, the move- 

40 nnent information (right left) relative to the parallax. On 
the basis of this information, the module 36 generates 
the left image LI and the right image Rl destined to be 
sent to modules 1 8 and 20 of figure 6. 
[0101] The block or module 30 assesses two impor- 

45 tant parameters: the minimum distance recommended 
to the user to view the screen in comfort and the reso- 
lution of the image as pixel/millimetre ratio. 
[0102] The distance D of the user from the screen S 
is a function of the maximum aperture of the eye of the 

50 diagonal dimension of the video. Its value indicates the 
threshold below which the observer cannot view the im- 
ages comfortably. Clearly, by increasing the diagonal di- 
mension of the video the value of D increases conse- 
quently. Block or module 32 assesses the maximum par- 

55 allax value M that can be obtained in relation to the in- 
terocular distance, indicated with B, and to the depth 
effect desired (F(D)). 

[0103] This occurs on the basis of the geometry rep- 
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resented in figure 8, where once again the references 
LE and RE indicate the left eye and the right eye, re- 
spectively. 

[01 04] Block 34 perfornns transformation of the paral- 
lax from millimetres to number of pixels taking account 
of the pixel/millimetre ratio calculated previously. As a 
result it is possible to know exactly how much the source 
pixel must be moved to the right or left according to the 
relations 

parallaxpi^^, = ratio, parallax ^ni^^ter 

parallaxpf^^i = ^pixelpi^^i.prallax^ii„^^tJ 

[0105] Block 36 is instead the true movement motor, 
which must take account of the distribution of the total 
parallax value. 

[0106] Consequentto the considerations above, or by 
dividing the creation of the right and left image in two, 
the parallax value must also be divided into two parts. 

Each of these parts generally corresponds to a fraction 
of the overall movement and can therefore be either ex- 
tremely different from or equal to each other (for exam- 
ple, by being equal to half of the overall movement). 
[01 07] This takes place according to criteria schemat- 
ically represented in figure 9, corresponding in practice 
to the definition of a sort of virtual eye VE in the inter- 
mediate position between the left eye LE and the right 
eye RE. 

[0108] In particular, it is possible to recreate the two 
images even in a manner that is not equidistant for right 
and left. 

[0109] The following solutions are therefore possible, 
for example: 

CASE 1: 

[0110] 

source image or SOURCE = RIGHT 
new image or NEW = LEFT 

CASE 2: 

[0111] 

SOURCE/2 = RIGHT 
SOURCE/2 = LEFT 

CASE 3: 

[0112] In general 

SOURCE/A = RIGHT 
SOURCE/B= LEFT 



[0113] In such a way that (L/A + R/B) = 1 with A and 
B both equal to and different from each other. 
[0114] As soon as the parallax value of all the pixels 
of the image has been assessed, all the elements to 

5 generate binocular vision are available. 

[01 1 5] As long as the final destination of the pixel falls 
within the space identified by the image there are no par- 
ticular problems. However, as soon as this goes beyond 
the dimensions pre-established by the screen controls, 

10 this causes ambiguity that must be solved. 

[0116] In particular, by assuming that we have an 
identical parallax value for all the pixels belonging to the 
same line, in the final image some of these no longer 
exist and some positions are not covered by any original 

15 pixel. 

[01 1 7] This situation can be observed directly in figure 
10, where, in relation to a source line LS, it is possible 
to observe the effect of a movement to the left LM and 
a movement to the right RM, respectively. 
20 [01 18] In practice, it can be observed that in both cas- 
es a pixel in the end position of the line is taken out of 
the screen, while a hole is created corresponding to the 
internal end. 

[0119] In the final image, in addition to holes or gaps 
25 inside the image, a black edge is also present due to the 

lack of information to enter in the corresponding lateral 
bands. The dimension of this edge depends principally 
on the parallax. In particular, it depends on the maximum 
parallax value of those pixels that do not fall inside the 
30 image. 

[0120] Nonetheless, the dimension of the edges that 
remain blacked does not depend on the value of the 
maximum parallax applied to the image, but on the max- 
imum parallax that the pixels may have. In other words, 

35 the maximum parallax defines the extent of the maxi- 
mum depth effect of the screen . However, if no pixel has 
a value corresponding to the depth map, this value will 
never be reached. Therefore, there is a direct and pro- 
portional dependence with the content of the depth map. 

40 [0121] Returning to the phase to apply the parallax 
value of the pixels and extending the possibility of hav- 
ing different parallax values for adjacent pixels belong- 
ing to the same line or row of the video, or in the bound- 
ary regions between two portions of image with different 

45 depth and therefore parallax values, there may be cases 
in which different parallax values cause the lack of data 
with which to fill this distance. 

[0122] Figure 1 1 illustrates an example in which, after 
a left movement, it is no longer possible to establish and 

50 state with certainty which values the pixels to the right 
of D and to the left of E must have. 
[0123] There are different ways of reducing these 
problems: for example, as already mentioned, it is pos- 
sible to use the original information in the homologous 

55 positions of the original image. However, the larger the 
space defined by the hole or gap, the more difficult it is 
to reconstruct it correctly with no more information than 
that already considered. On the contrary, the smallerthe 
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empty space, the less the three-dimensional effect in- 
troduced will be. 

[0124] An opposite situation is found when the paral- 
lax values cause superimposition between original pix- 
els, requiring the introduction of a function to control the 
forward-backward relation on the basis of the parallax 
value in question. This situation is represented, again 
with reference to left movement, in figure 12. 
[0125] The aforesaid function can be described and 
implemented simply with reference to perspective ge- 
ometry, thanks to which it is known that the greater the 
parallax, the greater the distance the relative point is 
from the observer and therefore as these points cannot 
be visible they must be darkened by those with least par- 
allax in homologous positions. Consequently, forthis as- 
sumption to remain valid it is necessary to ensure that 
it is complied with in these cases in which superimposi- 
tion is found. 

[0126] Hereunder a solution is illustrated to calculate 
the parallax to apply for each original pixel of the image. 
[0127] The relative assumptions must be based on 
the following elements: image resolution, distance of the 
observer from the screen, screen dimension and inte- 
rocular distance of the observer. 
[01 28] Added to these elements is the depth map as- 
sociated to the source. As has already been seen, the 
depth map forms the connection between the two-di- 
mensional source at the three-dimensional output. 
[0129] Moreover, given an initial image a single depth 
map does not exist, meaning that, given a certain initial 
image, it is possible to reconstruct different, virtually in- 
finite, three-dimensional effects. 
[0130] Clearly, the greater the number of levels, the 
greaterthe effectthatcan be reconstructed will be. How- 
ever, it is also true that, having a high number of levels 
available, but requiring a limited three-dimensional ef- 
fect, these effects tend to collapse into themselves, so 
to speak, returning the solution to the initial point. 
[0131] Moreover, the fact that the distinction between 
depth and disparity map has been defined is important 
in order to reconstruct a three-dimensional effect irre- 
spective of this fact. 

[0132] In relation to the number of depth levels avail- 
able, it can be noted that, by assuming we have a map 
drawn at grey level on, for example, 8 bits, up to 256 
levels are available. This number may be excessive to 
describe, for example, an indoor scene, while on the oth- 
er hand it may not be sufficient for a panoramic scene. 
[0133] However, in this regard it must again be men- 
tioned that the object sought here is not to reconstruct 
the natural three-dimensional effect, but to manage a 
three-dimensional effect, capable of managing all types 
of map, thus capable of manipulating the three-dimen- 
sional effect as required: this feature is of particular im- 
portance, for example, in entertainment applications or 
medical and scientific applications. 
[0134] In this respect, with reference to figure 13 in 
which, once again, the references LE and RE indicate 



the left eye and right eye, respectively, separated by the 
interocular distance B. 

[0135] The value of the parallax can be seen as a 
function of the interocular distance B. This is mainly true 
5 when it is asserted that the parallax cannot be greater 
than B in the cases in which: 

a point A' is found in the space between the projec- 
tion plane S and infinity, and 
10 - apoint A" is found between the projection plane and 
half the distance between this plane and the observ- 
er, the two cases represented in figure 13 repre- 
senting limit positions of these conditions. 

15 [0136] More generally, whatever the position of the 
point inside the space defined above, as the human 
viewing system is originally convergent, this maximum 
parallax equal to B should never be reached. 
[01 37] By now dividing the system between observer- 

20 screen distance and screen-background of the screen 
distance, it can be stated that by fixing these two values, 
it is possible to obtain the maximum quantity of parallax 
applied. 

[0138] Assuming we define these quantities as iden- 
25 tical, it can be obtained that the maximum parallax is 
equal to half of the interocular distance, thus defining a 
convergent vision more comfortable for the human eye 
in the conditions represented in figure 14. 
[0139] Generalizing, a variable distance can be de- 
30 fined, with a minimum value equal to D, at which the 
observer must be positioned from the screen S, while it 
is possible to define the depth inside the screen as var- 
iable. 

[0140] Considering a system in which this depth, in- 
35 dicated with P, is equal to the value of the depth effect 
according to the diagram in figure 1 5 and exploiting the 
relation between similar triangles it follows that: 

40 B:(D+P) = M:P (5) 

where: 

M is the maximum parallax 
45 . B is the interocular distance 

P is the depth inside the screen, and 

D is the minimum distance of the observer from the 

screen. 

50 [0141] From the relation (5), the depth of the screen 
can be calculated as follows: 

M = (B.P)/(D+P) (6) 

55 

[01 42] The next step is to define the best scale func- 
tion to associate to each value of the depth map the cor- 
responding parallax value M to be associated to each 
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pixel. 

[0143] The area in wliich this function must be applied 
is linnited to the relation definable as "mininnum parallax 
to the nearest objects and maximum parallax to those 

farthest away". 

[01 44] The function that seems most reasonable con- 
sidering the concept of depth perception, is the increas- 
ing monotone linear function which connects the two 
ends of the area of application according to the criteria 
represented in figure 1 6 in which the axis of the abscis- 
sas represents a parallax value between a minimum val- 
ue Min and a maximum value Max and the scale of the 
ordinates corresponds to an ideal depth measurement 
between a minimum value, corresponding to 0 of the rel- 
ative scale of the ordinates, and a "far" value corre- 
sponding to the top end of the area of application of the 
aforesaid function. 

[0145] As is shown in figure 1 6, the maximum parallax 
is associated to the objects furthest away, while the min- 
imum parallax is associated to those nearest. 
[0146] It therefore becomes necessary to associate 
the value inside a depth map with the relative parallax, 
or better to define which of the two situations represent- 
ed in figures 1 7 and 1 8 correspond to the case in ques- 
tion. 

[0147] In actual fact, the problem is easy to solve, as 
irrespective of how the depth map associates the values 
of grey to the three-dimensional distance, it must be 
possible not to make the error, which in these cases 
means reconstructing each three-dimensionality back- 
wards. From the figures of the map and the concept that 
this map has been defined with levels of grey (for exam- 
ple from 0 to 255, or on 8 bits), it follows that the function 
to be implemented is the one represented in figure 19, 
where the scale of the ordinates represents the levels 
of grey. 

[01 48] It must still be defined where to make the max- 
imum and minimum parallaxes correspond in relation to 

the viewing system. For simplicity and without detriment 
to all generalization, it is possible to establish that the 
zero parallax corresponds to the viewing screen, while 
the maximum parallax coincides with the background of 
the system. 

[0149] In general, the experiences conducted by the 
Applicant show that it is not advantageous to introduce 
negative parallaxes as if a negative parallax were to be 
associated to objects which, being positioned partly out- 
side the viewing screen (for example, new objects en- 
tering the main scene) are not completely visible, these 
would jump out of the screen, and thus be incomplete 
and not comfortably observable. 
[0150] It is thus possible to easily define the effect of 
distance between the screen and the background on the 
basis of simple relations. 

[0151] For example, considering figure 1 9, it is possi- 
ble to write the linear function that defines the parallax 
as a function of the maximum parallax M and the depth 
value as follows: 



parallax = M.(1 -depth value/255) (7) 

and using the relation (6) set forth above the following 
5 is obtained: 

parallax = BP/ (D+P) . (1 -depth value/255) (8) 

10 [0152] Therefore by defining a function for which the 
maximum parallax is equal to: 

M = B/N with N equal to 2, 3, ... (9) 

15 

where N is the reduction factor, we can obtain: 
P= D/(N-1) with N equal to 2, 3, ... (10) 

20 

or a direct relation between the distance of the us- 
er from the screen and the depth effect inside the 
screen. 

[0153] The minimum value for N is 2 as a higher par- 
25 allax value is not possible. Therefore the maximum par- 
allax and the depth of the screen are made to depend 
on the reduction value N. A reference value of N to ob- 
tain comfortable three-dimensional vision can be ob- 
tained through experimental tests. 
30 [0154] Bearing in mind the relation (7) already seen 
above and using the relation (9) set forth above we can 
finally obtain: 

parallax = (B/N).(1 -depth value/255) (1 1) 

[0155] In this way the variability of the depth effect is 
connected to the programmable reduction factor N of 
the three-dimensional effect desired. The value N has a 

40 fundamental role in creating the occlusions in the con- 
struction process of the two right and left images so that 
as N decreases there is an increase in the magnitude 
of the holes between originally adjacent objects and an 
increase in the dimension of the lateral edge of the im- 

45 age that is blacked. 

[0156] Figure 20 indicates various possible results of 
three-dimensional reconstruction of an original image a) 
from a depth map indicated with b). 
[0157] In particular, the images indicated with c) to h) 

50 correspond to respective anaglyphs for N=4, N=5, N=6, 
N=7, N=8, N=9. 

[0158] Obviously, the lower the value of N , the greater 
the maximum parallax is, causing less comfortable 
three-dimensional vision of the anaglyph. It is obvious 
55 that, as a function of the application, it is possible to 
choose a correct compromise between the value of N 
and a comfortablethree-dimensional vision, in particular 
as regards the non-perception of holes and gaps. 
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[0159] Naturally, without detriment to the principle of 
the invention, the details of embodinnent and the fornns 
of innplementation may be widely varied in relation to the 
description and illustration provided herein, without 
however departing from the scope of the invention as 
defined in the appended claims. 



Claims 

1 . A process for generating, from at least one monoc- 
ular image (a), at least one image with stereoscopic 
effect (c to h), characterized in tliat it comprises 

the operations of: 

providing a depth map (b) of said monocular im- 
age (a), which defines a mapping of the objects 
present in the monocular image (a) relative to 
their spatial position in the direction of depth in 
relation to the monocular image, 
determining, for the pixels included in said mo- 
nocular image (a) and as a function of said 
depth map (b), the respective parallax values 
indicative of a quantity of movement for each 
pixel to obtain at least a stereoscopic effect (c 
to h) in relation to said monocular image (A), 
applying to the pixels of said monocular image 
(a) a movement corresponding to said quantity 
of movement so as to generate, through the ef- 
fect of said movement, at least one image with 
stereoscopic effect. 

2. Process according to claim 1, characterized in 
that, as a result of said movement of pixels, said 
image with stereoscopic effect (cto h) is susceptible 
to contain superimposed areas between moved pix- 
els and in that the procedure comprises the oper- 
ation of eliminating (22) from said image with ster- 
eoscopic effect (c to h) the superimposed areas 
maintaining, between superimposed pixels, the pix- 
els with least parallax. 

3. Process according to claim 1 or claim 2, character- 
ized in that, as the result of said movement of pix- 
els, said image with stereoscopic effect (c to h) is 
susceptible to present gaps corresponding to a lat- 
eral edge of the image and in that the procedure 
comprises the operation to mask (22) said lateral 
edge. 

4. Process according to any of the claims from 1 to 3, 
characterized in that, as the result of said move- 
ment of pixels, said image with stereoscopic effect 
(c to h) is susceptible to present gaps between ob- 
jects included in said image and in that the proce- 
dure comprises the operation to fill in said gaps with 
information obtained from pixels located in the ho- 
mologous and/or nearby positions in said monocu- 



lar image (a). 

5. Process according to any of the claims from 1 to 3, 
characterized in that, as the result of said move- 

5 ment of pixels, said image with stereoscopic effect 

(c to h) is susceptible to present gaps between ob- 
jects included in said image and in that the proce- 
dure comprises the operation to maintain, at least 
in part, in said image with stereoscopic effect said 
10 gaps between objects included in said image with 
stereoscopic effect. 

6. Process according to any of the preceding claims, 
characterized in that it includes the operation to 

15 allocate to said pixels of said monocular image (a) 
respective parallax values obtained from a maxi- 
mum parallax value (IVI) as a function of a weighting 
rule. 

20 7. Process according to claim 6, characterized in 
that said weighting function is a scalar function. 

8. Process according to claim 6, characterized in 
that said weighting function follows a linear weight- 

25 ing rule. 

9. Process according to any of the preceding claims, 
characterized In that it comprises the operation of 
allocated to said pixels of said monocular image (a) 

30 parallax values linked to a maximum parallax value 
(M) determined as a function of the interocular dis- 
tance (B). 

10. Process according to any one of the preceding 
35 claims, characterized in that it comprises the op- 
eration to allocate to said pixels of said monocular 
image (a) parallax values linked to a maximum par- 
allax value (M) determined as a function of the po- 
sitioning (P) of the respective object in the sense of 

40 depth within the extent of said monocular image as 
a function of said depth map. 

11. Process according to any of the preceding claims, 
characterized in that it comprises the operation to 

45 allocate to said pixels of said monocular image (a) 
parallax values linked to a maximum parallax value 
(M) determined as a function of a parameter of min- 
imum observation distance (D) of said monocular 
image with stereoscopic effect. 

50 

12. Process according to claims 9, 1 0 and 1 1 , charac- 
terized in that said maximum parallax value (M) is 
determined on the basis of the relation: 

IVI = (B.P)/(D+P) 

where: 
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M is said maximum parallax value, 

B is the interocular distance, 

P is said positioning of tine respective object in 

the direction of depth, and 

D is said parameter of minimum observation 5 
distance. 



claims, characterized in that it comprises the op- 
eration to determine said parallax values in terms 
of absolute distance and then convert said absolute 
distance into a value of number of pixels as a func- 
tion of the resolution desired in said image with ster- 
eoscopic effect. 



13. Process according to any one of the preceding 
claims, characterized in tliat it comprises the op- 
eration to allocate to said pixels of said monocular 
image (a) parallax values linked to a maximum par- 
allax value (M) determined on the basis of a relation 
of the type: 



20. Process according to any one of the preceding 
claims, cliaracterized in tliat said monocular im- 

10 age and said image with stereoscopic effect have a 
common horizontal direction and in tliat said re- 
spective quantity of movement is applied to the pix- 
els exclusively in said common horizontal direction. 



M = B/N, 

where 

B is the interocular distance, and 
N is a reduction factor. 

14. Process according to claim 5 and claim 13, char- 
acterized in tliat it comprises the operation to se- 
lectively detennine said reduction factor (N) so as 
to minimize the incidence of said gaps in informa- 
tion between adjacent objects present in said image 
with stereoscopic effect. 

15. Process according to claim 1 or claim 14, ciiarac- 

terized in that it comprises the operation to gener- 
ate said at least one image with stereoscopic effect 
in the form of anaglyph (c to h). 

16. Process according to any one of the preceding 
claims, characterized in that it comprises the op- 
eration to generate a pair of said images with ster- 
eoscopic effect, applying said respective quantity of 
movement of the pixels of said monocular image (a) 
when generating only one of said images with ster- 
eoscopic effect. 

17. Process according to any of the claims from 1 to 15, 
characterized in that it comprises the operation to 
generate a pair of said images with stereoscopic ef- 
fect, applying to said pixels of said monocular image 
(a) respective movements correlated to said quan- 
tity of movement when generating both of said im- 
ages with stereoscopic effect. 

18. Process according to claim 17, characterized in 
that it comprises the operation to apply to said pix- 
els of said monocular image, when generating said 
two images with stereoscopic effect, respective 
movements equal to fractions (A, B) of said quantity 
of movement. 

19. Process according to any one of the preceding 



15 21 . A system for generating, from at least one monoc- 
ular image (a), at least one image with stereoscopic 
effect (c to h), characterized in that it comprises: 

at least one input module (12, 14)) configured 
20 to read said monocular image (a) and a depth 

map (b) of said monocular image (a), said depth 
map (b) defining a mapping of the objects 
present in said monocular image (a) relative to 
their spatial position in the direction of depth in 
25 relation to said monocular image, 

a processing module (1 6) to determine, for the 
pixels included in said monocular image (a), 
and as a function of said depth map (b), respec- 
tive parallax values indicative of a respective 
30 quantity of movement by which each pixel must 

be moved in said image with stereoscopic ef- 
fect (c to h) in relation to said monocular image 
(a), said processing module (1 6) applying to the 
pixels of said monocular image (a) a movement 
35 corresponding to said respective quantity so as 

to generate, through the effect of said move- 
ment, said image (1 8, 20, 24) with stereoscopic 
effect. 

40 22. System according to claim 21 , characterized in 
that, as the result of said movement of pixels, said 
image with stereoscopic effect (c to h) is susceptible 
to contain superimposed areas between moved pix- 
els and in that the system comprises a further 

45 processing module (22) configured to eliminate 
from said image with stereoscopic effect (cto h) said 
superimposed areas maintaining, between the su- 
perimposed pixels, the pixels with least parallax. 

50 23. System according to claim 21 or claim 22, charac- 
terized in that, as a result of said movement of pix- 
els, said image with stereoscopic effect (c to h) is 
susceptible to present gaps corresponding to a lat- 
eral edge of the image and in that the system com- 
55 prises a further processing module (22) configured 
to mask (22) said lateral edge. 

24. System according to any one of the claims from 21 
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to 23, characterized in that, as a result of said 
movement of pixels, said image with stereoscopic 
effect (c to h) is susceptible to present gaps be- 
tween objects included in said image and in that 
the system comprises another processing module 5 
(22) configured to fill in said gaps with information 
obtained from the pixels situated in the homologous 
positions in said monocular image (a). 



mined as a function of a parameter of minimum ob- 
servation distance (D) of said image with stereo- 
scopic effect. 

32. System according to claims 29, 30 and 31 , charac- 
terized in that said processing module (1 6) is con- 
figured to determine said maximum parallax value 
(M) on the basis of the relation: 



25. System according to any of the claims from 21 to io 
23, characterized in that, as a result of said move- 
ment of pixels, said image with stereoscopic effect 

(c to h) is susceptible to present gaps between ob- 
jects included in said image and in that the system 
comprises another processing module (22) config- 15 
ured to maintain, at least in part, in said image with 
stereoscopic effect said gaps with information ob- 
tained from the pixels situated in the homologous 
positions in said monocular image (a). 

20 

26. System according to any of the preceding claims 21 
to 25, characterized in that said processing mod- 
ule (16) is configured to allocate to said pixels of 
said monocular image (a) respective parallax val- 
ues obtained from a maximum parallax value (M) 25 
as a function of a weighting rule. 

27. System according to claim 26, characterized in 
that said processing module (16) is configured to 
apply, as said weighting function, a scalar function. 30 



M = (B.P)/(D+P) 

where: 

M is said maximum parallax value, 

B is the interocular distance, 

P is said positioning of the respective object in 

the direction of depth, and 

D is said parameter of minimum observation 

distance. 

33. System according to any one of the preceding 
claims from 21 to 32, characterized in that said 
processing module (16) is configured to allocate to 
said pixels of said monocular image (a) parallax val- 
ues linked to a maximum parallax value (M) deter- 
mined on the basis of a relation of the type: 

IVI = B/N, 



28. System according to claim 26, characterized in 
that said processing module (16) is configured to 
apply said weighting function following a linear 
weighting rule. 



where 

B is the interocular distance, and 
35 - N is a reduction factor. 



29. System according to any one of the preceding 
claims 21 to 28, characterized in that said 

processing module (1 6) is configured to allocate to 
said pixels of said monocular image (a) parallax val- 40 
ues linked to a maximum parallax value (M) deter- 
mined as a function of the interocular distance (B). 

30. System according to any one of the preceding 
claims 21 to 29, characterized in that said 45 
processing module (1 6) is configured to allocate to 
said pixels of said monocular image (a) parallax val- 
ues linked to a maximum parallax value (M) deter- 
mined as a function of the positioning (P) of the re- 
spective object in the direction of depth of the extent so 
of said monocular image as a function of said depth 
map. 

31. System according to any one of the preceding 
claims 21 to 30, characterized In that said 55 
processing module (1 6) is configured to allocate to 
said pixels of said monocular image (a) parallax val- 
ues linked to a maximum parallax value (M) deter- 



34. System according to claim 25 and to claim 33, char- 
acterized in that said processing module (16) is 

configured to selectively determine said reduction 
factor (N) so as to minimize the incidence of said 
gaps in information between adjacent objects 
present at least in said image with stereoscopic ef- 
fect. 

35. System according to claim 21 or claim 34, charac- 
terized in that said processing module (1 6) is con- 
figured to generate said at least one image with 
stereoscopic effect in the form of anaglyph (c to h). 

36. System according to any one of the preceding 
claims from 21 to 35, characterized In that said 
processing module (1 6) is configured to generate a 
pair (1 8, 20) of said images with stereoscopic effect, 
applying said respective quantity of movement of 
the pixels of said monocular image (a) when gen- 
erating only one of said images with stereoscopic 
effect. 
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37. System according to any one of the preceding 
claims from 21 to 35, characterized in tliat said 
processing module (1 6) is configured to generate a 
pair of said images with stereoscopic effect (18, 20), 
applying to said pixels of said at least one monoc- 5 
ular image (a) respective movements correlated to 
said quantity of movement when generating both of 
said images with stereoscopic effect. 

38. System according to claim 37, characterized in io 
that said processing module (16) is configured ap- 
ply to the pixels of said at least one monocular im- 
age (a), when generating said two images with ster- 
eoscopic effect (18, 20), respective movements 
equal to fractions (A, B) of said quantity of move- 15 
ment. 

39. System according to any one of the preceding 
claims from 21 to 38, characterized in that said 
processing module (16) is configured to determine 20 
said parallax values in terms of absolute distance 
and to then convert said absolute distance into a 
value of number of pixels as a function of the reso- 
lution desired in said image with stereoscopic ef- 
fect. 25 

40. System according to any one of the preceding 
claims from 21 to 39, characterized in that said 
monocular image and said image with stereoscopic 
effect have a common horizontal direction and in 30 
that said processing module (16) is configured to 
apply to the pixels of said monocular image (a) said 
respective quantity of movement exclusively in said 
common horizontal direction. 

35 
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